Detection and removal of barcode swapping in single-cell RNA-seq data
نویسندگان
چکیده
Multiplexing is a widely-used procedure that allows multiple DNA libraries to be pooled together for efficient sequencing. However, recent reports suggest that the DNA barcodes that label different libraries can “swap” on patterned flow-cell Illumina sequencing machines, including the HiSeq 4000, HiSeq X, and NovaSeq, thereby mislabelling molecules [1, 2]. This may compromise many types of -omic assays, but it is particularly problematic for single-cell RNA-seq (scRNA-seq), where many libraries are multiplexed together. A number of widely used plate-based scRNA-seq library preparation methods isolate and process individual cells in wells of a microwell plate, before performing library preparation in parallel [3]. A unique combination of sample barcodes labels the library of each cell, typically with one barcode at each end of a cDNA molecule. One barcode provides a row index for each cell on the microwell plate and the other barcode provides a column index. Barcode swapping therefore moves transcripts between cells. We generated a dataset (see Supplementary Files, “Richard data”) where two plates of single-cell libraries were multiplexed for sequencing on the HiSeq 4000 using two mutually exclusive barcode sets. We expect to only observe reads
منابع مشابه
A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data
Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...
متن کاملA Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data
Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...
متن کاملI-42: Origins and Differentiation of Somatic Progenitors of The Mammalian Gonad Revealed by Single Cell RNA-Seq
Background - MaterialsAndMethods N;Results N;Conclusion N;
متن کاملI-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing
Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...
متن کاملscPipe: a flexible data preprocessing pipeline for single-cell RNA-sequencing data
Single-cell RNA sequencing (scRNA-seq) technology allows researchers to profile the transcriptomes of thousands of cells simultaneously. Protocols that incorporate both designed and random barcodes to label individual cells and molecules have greatly increased the throughput of scRNA-seq, but give rise to a more complex data structure. There is a need for new tools that can handle the various b...
متن کامل